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i) System and method for document processing. 

?» A document processing system controls the 
5 priori documents represented in page des- 
CTlotion language form. Documents are rep- 
Ssented by a page description language which 
rSSSiSd so that definition and declaratory 
commands are positioned only i * 
of each distinct document segment Each docu 
ment has prologue sections, which contain defi 
nition and declaratory commands, and content 
portions which contain the specific tokens or 
Commands for defining specific .mages The 
Son and declaratory commands ,n the 
prologue sections of the document are ar 
Sid m a hierarchical tree so that each dsflnl- 
ton and declaratory command has a scope 
Responding to the portion of the hierarchy 
tree subtended by that command A stmcture 
processor handles resource declaration i and I de- 
fSms, dictionary generation^ context decla- 
rations and references to data external to me 
Scument A content processor Presses the 
tokens using the definitions and 
set up by the structure processor, and an Imag 
fng driver module translates the documen : into 
commands suitable for any of several types of 
crinters, as well as for communication by tele- 
Shone ne to a remote device. <?n feature of the 
document processing system is the ab«h£ 'to 
Query a remote device to determine whether ft 
hal the resources required for Posing a 
Scument transmitted in a high level < snorted 
form It then either transmits the document in 
rasTerized bit map form if the required resour- 
Sf^ not available, or ««n^ 
in a higher level encoded form if the required 
resources are available. 
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tured page description lanouace Into a nuZ^X /? rtrans,atin ° documents, represented fn a struc 
BACKGROUND OF THE INVENTION 

10 by preceding each c^ 

withdaisywLlanddotmairprinteTbu ° 

graphical images. nrara .™wasnotwell suited for pnnt.ng documents that combined text and 

15 (a trademark of Adobe Systems Incorporated! an^ M llrnUl li^' I kn ° Wfl exam P |es be ing PostScript 

Another publication concerning POL's a^emreT i£ Soured »tZ I T ^ PUb " Shin9 C ° mpany " 
R. Buckley, published by Simon & Schuste Inc i^I nZ r * J " Harrin 9 ton a "d Robert 

must be inspected in order to c^min ^ .^S^p^^'L"*' °° nlBnta ° f the *»«»* 

J particular document Alternately, it is quite possTwe fo thS f V resources necessary to print a 
th^^^ 

the state of the documents page setup parameters (iT f^J 9 n O " he 1 document in ° rder to determine 
so on) at the beginning of the spec'tdpag^ 

read the entire PDL description I of the do^'Jn^V u °^ ' P ™ controlleror a Print driver program must 
between the beginning ^ 

straightforward, it is wasteful Adobe Svitam inn t * P f ' Wh " e th,s page setu P s <*™ing process is 
this problem. YStem ,na haS proposed P^mming conventions to avoid or reduce 

proposed by ISO), converts doeumentsTervreen^va^letv oTdtffeit (H ndard Pa 9 e Description language 
transmit dooum.nl, to remote d.vloi I ITZtll **""•*»«»« desedptton formats, end also 



SUMMARY OF THE INVENTION 



-s^entd^^^ 

guag. vvhloh Is struotun5 so ,„S„Z3 Za^rT^lTr'"'^ * 3 M3e aeScr *" to ' ^ 
""—ntse^^^^^^^ 
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ha?a scooe corresponding to the portion of the hierarchical tree subtended by that command. 

H^ioitinnc in the hierarchical tree above the specified document portion need to be processed. This is Dotn 

wmwmmm 

transmission costs. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Addittonal objects and features of the invention will be more readily apparent from the following detaBed 
description and appended claims when taken in conjunction with the drawings ^ 

Fiqures 1Aand 1B are block diagrams of a document represented m structured PDLform. 

Figure 2 is a block diagram of the hardware in a document processing system. 

Figure 3 is a block diagram of the software modules in the preferred embod.ment embodi _ ent 
F Sure 4 depicts the primary data structures used by the software modules In the preferred embodiment 
HgurLsSS 

E^hTSSSLm depicting operation of the Content Processor of the preferred embodiment 
SIS a blot; 'diagram ^communication processor software of the preferred embodiment for 

transmitting documents to remote devices. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

Hierarchically Structured Page Desc ription Language 

Referring, to Figures 1Aand 1B, a document 1 00 used by the present invention is represented by a set of 

"p 'level page set 102 which sets up resources, dictionary and external definitions useable by the entire 
d ° C F^fhe purposes of this description, resources, dictionary and external definitions t h "«™ ' ™£ 
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ment oTlZ^Jtl^^ f, T nS - EXtema ' defin,tl0nS reference data structures ^rnal to a docu- 
^moranda 9 96 l ° S ° th3t ' S to be printed at the to " or bottom * and 

In addition to the top level page set 102 are a set of second level of page sets 104. 106 Each pace set 

l^^SJn^fllWH 112 Pa9e S6t Pr0, ° 9Ue 110 p? ° V,deS -so-ce deSiSns 110 

iS^^lT ; d '= ti0 " ar y 9 eneratfo " statements 110-C and external definitions 110-D used bysub- 
* 0 the d ° Cument The bod y of a page set consists of one or more pictures 120. A picture 120 

guous ^^ 

ThJ^H^T ° f 6aC . h , PiChJre 1 20 C ° mpriSeS a " Opti0nal prc, °9 ue 122 fol,owe <» "y a picture body 124 
^eSesK 

n^Jwf segment of the document (e.g., the image elements for one page). Picture prologues 122 

srtTk^Z ^r a " d defin,tions - on statements and context dZZs ul2 

solely by that one picture (i.e., that segment of the document). 

Looking at Figures 1A-1B as a whole, the data structure embodying document 100 is structured hierarch 
^ 9UeS, f° nta,nin9 d6f init '° n 3nd dec,aratorv commands, are positioned only at the beginning ofTch 

tSSE^tT"* M ° re SpeCifica ">'' each document has °P«°"al Prologue secOons Sconiin 
for SSrlc, V T! ndS ' 3nd C ° ntent P ° rtionS Which contain the s P ecific to ^ns or commands 
tnf " ma9eS - ^. rtnermore ' the de,initi °n and declaratory commands in the prologue sections 

of the document are arranged in a hierarchical tree so that each definition and declaratory command has a 
tte^Z'Z 1V° *? P ° rti0n ° f H he hierarChiCa ' free SUbtend6d by that command - ThS TnstSnoe 
set and TZZ h I ? C °T andS in the P rolo 9 ue <* P^e Set 1 apply only to the pictures in that page 
set and therefore do not apply to the pictures in Page Set N. 

A formal definition of the data structure of a document Is shown in Table 1. 



TABLE 1 
PDL Definitions 
document (pageset | picture) 

pageeet (prologue)?, <pageeet_body> ? 

prologue : t = (exteraal.decl*, informative decl* 
reeource def*. reeource_decl*, doc_prod inst decl* 
contended*, dict_gen__decl*, B et_up_proc*) 

pageBet_body (pageeet | picture) 

picture .: = (prologue)?, (picturejbody) ? 

picture.body :: = (picture | tokensequence ) * 

, All must occur in the order shown. 

I One and only one must occur. 

? Optional (0 or 1 time) 

* Optional and repeatable (0 or more times) 



Image Processor Hardware 



T K • thS ' ma9e Processor 150 of th e Present Invention Is preferably implemented as a 

SSSTLST'' ^ 3 ° enlral Pr ° CeSSln9 Unit < CPU > 152 such as the AMD 29000 madefy 

Advanced M.cro Dev.ces or any of the Motorola 68000 series microprocessor, and random access memory 
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Sr 1m 164. and host Interface 186 couples the Image Processor 10 e host computer 170. such » a 

hard disk and LkTntroller 174for storing items such as documents, font data, and data structures referenced 
b^terna^Sion statements; and a communication interface 160 or modem for telecommun.cat.on w.th 
remove devices such as facsimile machines and remotely located computers and printers 

The image processor 150 of the present invention is presented here as a stand a one ^^Itable for 
use as a peripheral device to one or more computers, such as in the print server of a local area network or as 
l^^tonoMe incorporated Into a computer. However, the image processor 150 may also be used 
as the engine/controller of a "PDL compatible- laser or Inkjet printer. 

Image Processor Software Modules 

Referring to Figure 3, the document to be handled by the Image Processor 1 50 is typically , gc '^ted by 

~-r^r,eirr= 

MMna Jolmlnt elements to Structure Processor 202. Structure Processor 202 interprets or decodes struc- 
ZaSb^SwauBPro^^ 

™r«Sr«.« P— » converts to«. into Imacir* «uc£n. 

^X^iSl^ remote device is a PDLcompatib.e printer), orwlll convertthe tokens .nto .mag.ng 

and aTso handles the task of rasterizing documents being sentto devices that cannot accept higher level print 
Se^^S^unlorf™ processor 208 coordinates the transmission of dooumMrt. ^ ""J 
dlvicea Finallv option processor 21 0 handles document options parameters from the host computer typically 
c^I^R^SX elements near the beginning of the document. Options parameters con trol such 
Sel as tr,; Tprinter to be used, whether the document is to be transmitted onto a telecommun.cat.on line, how 
to handle non-fatal syntax errors in the PDL command stream, and so on. 

Image Processor Data Structures 

Referring to Figure 4, the following data structures are used by the image jproce ssor to < stc .re ;^n«Uon 

..sins of storing resource definitions and declarations, which are then used by th. Oontant Processor wM. 
^SaT™^ 

S pnSlo pmcSsing pages of tfe document in that pageset.That dictionary would be popped off the stack 
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fted" in ™™ 5 f , P39 ! S6t I 6 "f 6 . d a " d P rocessi "9 of ^her Page set began, because the dictionary de- 
fined in page set 1 is not applicable to the other page sets. 

Table 2 represents an example of a very short document using a page description lanauaae structurprf in 
to Hey ™ues 8 V ° thereb/ Ch9n9ln9 th9 ° rd8r in Which the d ^°naries are searched 
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TABLE 2 
Example of Document 

< 1 DOC TYPE SPDL PUBLIC "XXX"> 
<SPDIj> 

< comment > 

Test File Header 

Copyright 1991 Ricoh Corporation 
All Riflhts Reserved 
Confidential and Proprietary 

pile s ctxtdcll.pro 

Author » Tatsuro Motoyama 

Version t 0.01 

File Created: June 5, 1991 
First Draft : June 5, 1991 

Update History: 

^■^ru a test asiss-ASSt" 

Bote: Put PSEUW codes before each SPDL test file. 

- SPDL PSEODO CODE LISTING - 

document- Picture * three dictionaries are defined 

Dictionary generator size J 

ssss tsss sssss. :u . 

2 3 a expect the a in the gamma to he executed 

picture 

Pr con?ext decl gamma beta alpha 

Pi toXense«x 2 3 a Expect the a in the alpha to be executed 

</ comment > 

<document> 

<pictur« spdlid--SPDL- 

Cnt n c ^nt>".i"ld r id e and cntnttyp are Public Object ID values 

< /comment > 
<prologue> 

<diC <dic?oen B ize-»3»><dictid><name>alpha</aame></dictid> 

^Th.'operand stack has the dictionary reference 
dup /a {add} put 
dup /d tdlv> put 
dup /m tinul) put 
</tok.enBeqx/dictsen> 
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d «P /a (div) put 
*up /a {aiul) put 
dup /a (add) put 
dup /» {sub} put 
</tokenseo> 
</dictgen> 

dup /a {aul} put 
«UP /d fadd} put 
dup /m {sub} put 
</tokenseq> 
<tokenseg> 

dup /a (div> put 
dup /c (cob} put 
</tokenseq> 
</dictgen> 
</dictgens> 
<cosnaent> 

dictionary stack 

beta 
alpha 

< /eoasnent> 
</prologue> 
<picbody> 

<tokens«q> 

</toLL:,> %exp " ct tbe *— »• s 

<picture> 

<prologue> 

<etactdecl> 

< ™ > «^</™>^>^t.</n W ><a m > alpta</oiUM> 

</proj,ogue> 
<conflnent> 

dictionary stack 

alpha 

beta 



<picbody> 

<tokenseg> 

</toLn 8 :c> expaot the Xwlt te »~ 5 

< /picbody> 
</picture> 
</picbody> 

^docSien^ <coaffleat > Printing the first page </c oana ent> 
</SPDI«>- 



I exlcal Analyze r and Parser Process 

as follows. € . „ on „ nthernracess inq on each document, the Lexical Analyzer 

Referring to Figure 5A, prior to performing an ^^^S?i«n to upper ease torm. and compares 
and Parser reads the first nine characters of '^^^^^Zd Parser concludes that document 
that string with "<IDOCTYPE». If there ,s a flag , otherwise it concludes that the 

is encoded in dear text form and sets ^^^^^^JJ^. There is a one-to-one cor- 
document is encoded in binary forrn and M *^ e *™™2Snat is more compact, while clear text is eas.er 

another document Is received. ana lvzer and parser routine checks to see if the end of the 

docu^r^^^ 

^^-afa^^ 

Saramete P rs, processing jumps to step 274, ^^^^^ are read at step 262. and then 
If the element being processed contains £ J^SJ fl that illegal or Inconsistent parameters 

tested for errors at step 266. If a parameter error *<^»W£ ^ ^ ge p g roce3S0r has been instructed 
values were received), processing of the TJ^^^'^^ioi. If the image processor has 
to "struggle" - i.e., to attempt to process <£^^ is discarded by not processing it further 

been instructed to "struggle" (step 268) he Jaj««J docU ment. Otherwise, if "struggling" has not 
and allowing processing to ^« n *,^ t n ^ n ^^ m ^r;i (one of the image processor's state para- 
(5 been enabled, the image processor's syn tex f e *'" 3 n 9 0 hi ft^er in the documentwill be pnnt- 

d Tnotro re inthee^^ 
so meter is checked at step 274. If it is £^^£J3 ^ ^ 

is not processed further. Otherw.se, the ne xt step sto cnec * bei p rooesse d solely for the purpose 

been set(step 275), which Isastatevanablemd^ 

of converting the document from binary £ cated for converting the element being 

and the document Is currently ^^^^^^^iccm^ H currently in clear text form 

- 
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routines is called at step 284. 9 Description Lavage, and the appropriate one of these 

5 Document Structure Processor 

tions^i^ 

ager 302, information declaration mlnaS 304 r™ h ff" 6 ma " a9er 30 °- external Ascription man- 
« ager 308. document p^^SS^SXiSi^^S^ ^ (W0Urea deC,aratfan man " 

manager 314, setup procedure manager 31^^ °' d,Ct,0nar y stack ""W 312, dictionary generator 
These software modules deSe^o^2d h ?2LSE" TT ^ err0r hand,er 320 - 

uses the stored document product*, Zon^u^o SiJ^ZrTl ' thedooum9 nt structure manager 300 
20 spending to unselected portions tJZlS^^^u^J^. ^ ° f the d0Cument 
soastop r o.detheapprop ri ated lc tiona^o^ 
Document Content Processor 

" token^^^^^ 204 has a token manager 350 which receives each 

by the C^Pnc^^^^T^^ whi f «» token element Is not interpreted 

device capabie of process^ PDLdo^ 
"™^iverMo^^ 

3" can interpret PDL documents) the content token* a « °, V" ^Pable printer (i.e., a printer which 

Driver Module. Finally. If ^S^^^^^^T^ * ^ ^ ndler t0 the ,ma 9 in 9 
can interpret PostScript languag e doc^^^ 
ans (using a simple table look-up conJe^^ 

Imaging Driver Module convere, °" methodology) by the Token Handler350 and then passed to the 

ceeds as follows. Operands for the element a r™h!w ? T 9 ' he reqUlred content Processing pro- 
352. and then the operator p^onofthre ^ 

354. While interpreting token elements th» nZZZ f 7 mter P reted b V operator execution controller 
» Pushed onto th™ stack ^S^^ZT^T^ 354 US8S the P™™*"* Piously 

Processed. The imaging para 9 ^ 

andttaTb?^^^^ 

operator, and sometimes P^SZS^Tir^^ °Tf ^ ^ corres P ondi n° ^"9s of 
erencestoundefinedkeys^rtoa^^ 

Imaging Driver Moduln 

for PostScript printers, HP Laserjet md" t^^muK^nS hT^ Driver pro 9 ra ™ ™ Prided 
ments to remote devices. A bit map printer VorTdoT f^Z^S" ? T PnnterS ' a " d forsendi "9 d °™- 
be sent an image in the form of a rasterSd bit ma A T^SSZ^mrTT* ?" M 
ment Into a bit mapped image, which can then bl £n™22f^ ^ ? nVSrtS 6aCh pase of the doou - 

printer that does not have a X?^^^^^^ SUCh 88 3 Simple dot matri * °' Inkjet 
Printing to an HP Laserjet or com^toZ^Z^^X T, V™ r ° Utine 380 is 3,50 used 
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they arrive. 

Communication Processor 

simile form toaremote facsimile machine.^ 

the resources requiredforproces^ teb(e ^ Q M is . listing of the 

the resources data structure 220 a " d n ^ document Then an inquiry protocol 

resources. Including memory, fonts and so on fo _proces P and querjes tnat devioe to 

module 392 established a communion itaSZ thl. could be done using the protocol for 

^en^^ 

mitting the document to the remote^ 

more compact and therefore more eff .dent ^ Jan send ng a ao document to be easily edited and rev.sed. 
more Information to the receiving device ^^^^e., the document wDl be transmitted to 

mi l5^S — has been des f bed prance toa few S££ 
by the appended claims. 



A document image printing controller. <**<V**™* of ge description language elements 

means for receiving . document JPJJJ^^iTSSlid dMuiMrt Including • hto««*- 
which define said document as a h-mnsWoal J Jee and sectto n S containing im- 

,cally ordered set of prologue sections ^2u«in?nS^S said prologue section are applicable 

stsroS 

^^ntproces.orforprocesslngsaid^ 

corresponding Imaging instructions for ^^SlSJKSi" e.ements, and stores representa- 
a document structure processor only those prologue elements ap- 

e3Ch "an ^^P^^SSS - — n9 S3id ima9in9 inStrUCti ° nS t0 3 Pri " ter 
port corresponding to said predefined printer device. 

tion language commands. 

transmitting said image bit map to a second printer port, 
using a predefined facsimile transmission protocol. 



11 



7. The document image printing controller of claim 1 , wherein 

9. A document Image printing controller, comprising- 
10. A method of printing a document, the steps of the method comprising- 



defined printer device. 

to a second printer port, 
simile transmission protocol. 

affirmatively to said querying by said protocol means. 
14 The method of claim 10. wherein said predefined printer device is configured to receive documents in 

^converting said content sections of said received document into corresponding PostScript com- 
^'irrnsmitting said corresponding PostScript commands to said predefined printer device. 

corresponding imaging instructions for a predefined P*JrdWj^ ^ sq 

said predefined printer device. 
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RESOURCE DECLARATION 
DICTIONARY GENERATOR 



RESOURCE DEFINITION 
RESOURCE DECLARATION" 
CONTEXT DECLARATION ~ 
DICTIONARY GENERATOR 



FIGURE 1A 




FIGURE 2 
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Portion Of Document Tree 
Subtended By Prologue Of 
Page Set 1 



FIGURE IB 
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IMAGE PROCESSOR 
150 



Imaging Driver Module 
^ £fifi (Figure 8) 



Printer A Printer B Printer C 

FIGURE 3 



Comm un ication Processed 
2Q8. (Figure 9) J 

Telecommunications Line 



L 




/ Structure Processor 






Content Processor) *^ 


► 1 





Imaging Driver Module 



FIGURE 4 
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Dictionary 
Stack 



Dictionaries 



Operand Stack | 
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State Parameters, 



Imaging Parameters 
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o r ^oH Rtmr t^i Ptamant FrnmfitH.nH.rfi Processor 



Read First 9 Bytes of Document 
and Convert to Upper Case 



I Analyzer 
Pflt-ftPr Process 



^260 



Set Input Type = Binary] 

1 



<P^XT^>^£g-w [set input Type = Clear Text \ 

FIGURE 5 A 



•390 




RESOURCE 
REQUIREMENT TABLE 
FOR TRANSMITTING 
DOCUMENT IN PPL FORM 



FACSIMILE G3 & G4 
TRANSMISSION 
HANDLER 



FIGURE 9 
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E 8 .! 8 i? 0 ™ HoSf nnrr ini^r C ommn n f rflt i» n , P mrr ~ nr 




End Of 
Routine 



^2 54 



Read One Structure/Content Element 
Perform Syntax & Semantic Analog 



Lexinal An fl t^p, 
& Parser Pr^ aw 




Convert Parameter Values to 
Predefined Internal Format 



Call Specified Structure Processor Function 
with Required Parameter Values 



After Processing 
, Element 



FIGURE 5B 
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STRUCTURE PROCESSOR 



Document Structure Manager • 
r^tmk Global/Local Scope of Structure Items 



External Declaration Manager c ^o ma i 
Controls References to Items Stored External 
To Document Being Processed 



Information Declaration Manager npscriDtior 
- Discards User Comments Embedded in PPL Descriptor 



Resource Definition Manager 

Defines Fonts And Other Resources Needed By 



Document 



Resource Declaration Manager 
- Allows Use of Specified Resources, Assigns Each 
Resource a Reference Name 



Document Production Instruction Manager 
- Controls Page Selection, Commands to Printer tor 
Options Such as Paper Tray Selection, 
Dual Side Printing, Struggling (i.e.. Ignoring Non-Fatal 
Errors in PPL Description of Document) 



P^J^gJgUrt- From Dictionary Smo* 



"^gtggSffSK, Vaiue Dictionary Tab.es 



Setup Procedure Manager , 
■ Controls State Variables, Such as Printable Area ot 
Page, Selecting Ink Color, etc. _ 



^s S s3ns Content Data) t0 Content Processor. 
Resolves References to External Items by Reading 
and tserirng Them Into Content Pata Stream Before 
Passinn Pata to Content Processor. 



Structure Error Handler 
- Controls Error Handling 



FIGURE 6 
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TOKENS PASSED BY 
STRUCTURE PROCESSOR 




DICTIONARY 
HANDLER 



CONTENT 
ERROR 



Undefined Key, i HANDLER 

Reference to Nonexistent V LER 
Dictionary 

FIGURE 7 



VALUES PASSED BY 
STRUCTURE AND 
CONTENT PROCESSORS 



State Parameters, 



Imaging Parameters 
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© resources are available. 
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